3 research outputs found

    Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation

    Get PDF
    Background: Manual curation of experimental data from the biomedical literature is an expensive and time-consuming endeavor. Nevertheless, most biological knowledge bases still rely heavily on manual curation for data extraction and entry. Text mining software that can semi- or fully automate information retrieval from the literature would thus provide a significant boost to manual curation efforts. Results: We employ the Textpresso category-based information retrieval and extraction system http://www.textpresso.org webcite, developed by WormBase to explore how Textpresso might improve the efficiency with which we manually curate C. elegans proteins to the Gene Ontology's Cellular Component Ontology. Using a training set of sentences that describe results of localization experiments in the published literature, we generated three new curation task-specific categories (Cellular Components, Assay Terms, and Verbs) containing words and phrases associated with reports of experimentally determined subcellular localization. We compared the results of manual curation to that of Textpresso queries that searched the full text of articles for sentences containing terms from each of the three new categories plus the name of a previously uncurated C. elegans protein, and found that Textpresso searches identified curatable papers with recall and precision rates of 79.1% and 61.8%, respectively (F-score of 69.5%), when compared to manual curation. Within those documents, Textpresso identified relevant sentences with recall and precision rates of 30.3% and 80.1% (F-score of 44.0%). From returned sentences, curators were able to make 66.2% of all possible experimentally supported GO Cellular Component annotations with 97.3% precision (F-score of 78.8%). Measuring the relative efficiencies of Textpresso-based versus manual curation we find that Textpresso has the potential to increase curation efficiency by at least 8-fold, and perhaps as much as 15-fold, given differences in individual curatorial speed. Conclusion: Textpresso is an effective tool for improving the efficiency of manual, experimentally based curation. Incorporating a Textpresso-based Cellular Component curation pipeline at WormBase has allowed us to transition from strictly manual curation of this data type to a more efficient pipeline of computer-assisted validation. Continued development of curation task-specific Textpresso categories will provide an invaluable resource for genomics databases that rely heavily on manual curation

    Helical ordering of envelope‐associated proteins and glycoproteins in respiratory syncytial virus

    Get PDF
    Human respiratory syncytial virus (RSV) causes severe respiratory illness in children and the elderly. Here, using cryogenic electron microscopy and tomography combined with computational image analysis and three-dimensional reconstruction, we show that there is extensive helical ordering of the envelope-associated proteins and glycoproteins of RSV filamentous virions. We calculated a 16 Å resolution sub-tomogram average of the matrix protein (M) layer that forms an endoskeleton below the viral envelope. These data define a helical lattice of M-dimers, showing how M is oriented relative to the viral envelope. Glycoproteins that stud the viral envelope were also found to be helically ordered, a property that was coordinated by the M-layer. Furthermore, envelope glycoproteins clustered in pairs, a feature that may have implications for the conformation of fusion (F) glycoprotein epitopes that are the principal target for vaccine and monoclonal antibody development. We also report the presence, in authentic virus infections, of N-RNA rings packaged within RSV virions. These data provide molecular insight into the organisation of the virion and the mechanism of its assembly

    Yorba Times: Special Edition on Safety

    Get PDF
    During the Spring 2016 semester, Dr. Noah Asher Golden\u27s Teaching of Writing K-12 students partnered with the Journalism class at Yorba Academy for the Arts. Through collaboration over a four-month period, Chapman\u27s future teachers and Yorba\u27s junior high journalists engaged a deep writing process to write a series of features, editorials, and news articles, all connected in some way to the overarching theme of safety. Thank you to Ms. Andrea Lopez, Ms. Tracy Knibb, and the Lloyd E. and Elisabeth H. Klein Family Foundation for supporting this project.https://digitalcommons.chapman.edu/yorba-chapman/1000/thumbnail.jp
    corecore